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Abstract 

A directional signal processing system for beamforming information signals. The 
system includes an oversampled filterbank, which has an analysis filterbank for 
transforming the information signals in time domain into channel signals in transform 
domain, a synthesis filterbank and a signal processor. The signal processor processes the 
outputs of the analysis filterbank for beamforming the information signals. The synthesis 
filterbank transforms the outputs of the signal processor to a single information signal in 
time domain. 
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DIRECTIONAL AUDIO SIGNAL PROCESSING 
USING AN OVERSAMPLED FILTERBANK 

5 FIELD OF THE INVENTION 

The present invention relates to audio signal processing applications where the 
direction of arrival of the audio signal(s) is the primary parameter for signal 
processing. Hie invention can be used in any application that requires the input audio 
signal(s) to be processed based on the spatial direction from which the signal arrives. 

1 0 Application of this invention includes, but is not limited to, audio surveillance 

systems, hearing aids, voice-command systems, portable communication devices, 
speech recognition/transcription systems, and any application where it is desirable to 
process signai(s) based on the direction of arrival. 

BACKGROUND OF THE INVENTION 

1 5 Directional processing can be used to solve a multitude of audio signal 

processing problems. In hearing aid applications, for example, directional processing 
can be used to reduce the environmental noise that originates from spatial directions 
different from the desired speech or sound, thereby improving the listening comfort 
and speech perception of the hearing aid user. In audio surveillance, voice-command 

20 and portable communication systems, directional processing can be used to enhance 
the reception of sound originating from a specific direction, thereby enabling these 
systems to focus on the desired sound. In other systems, directional processing can be 
used to reject interfering signal(s) originating from specific direction(s), while 
maintaining the perception of signal(s) originating from all other directions, thereby 

25 insulating the systems from the detrimental effect of interfering signal(s). 

Beamforming is the term used to describe a technique which uses a mathematical 
model to maximise the directionality of an input device. In such a technique filtering 
weights may be adjusted in real time or adapted to react to changes in the 
environment of either the user or the signal source, or both. 



CA 02397009 2002-08-07 



I r 

2 

Traditionally, directional processing for audio signals has been implemented 
in the time-domain using Finite Impulse Response (FIR) filters and/or simple time- 
delay elements. For applications dealing with simple narrow band signals, these 
approaches are generally sufficient. To deal with complex broadband signals such as 
S speech, however, these time-domain approaches generally provide poor performance 
unless significant extra resources, such as large microphone arrays, lengthy filters, 
complex post-filtering, and high processing power are committed to the application. 
Examples of these technologies are described in "Analysis of Noise Reduction and 
Dereverberation Techniques Based on Microphone Arrays with Postfiltering", C. 
10 Marro, Y. Mahieux and K. U, Simmer, IEEE Trans, Speech and Audio Processing, 
vol. 6, no. 3, 1998, and in "A Microphone Array for Hearing Aids", B. Widrow, 
IEEE Adaptive Systems for Signal Processing, Communications and Control 
Symposium, pp.7-1 1, 2000. 

In any directional processing algorithm, an array of two or more sensors is 
15 required. For audio directional processing, either omni-directional or directional 

microphones are used as the sensors, Figure 1 shows a high-level block diagram of a 
general directional processing system. As seen in the figure, while there are two or 
more inputs 100, 105 to the system 1 10, there is generally only one output 120. 

There are two common types of directional processing algorithms: adaptive 
20 beamforming and fixed beamforming. In fixed beamfbrming, die spatial response -or 
beam pattern - of the algorithm does not change with time, as opposed to a time- 
varying beampattern in adaptive beamforming. A beampattern is a polar graph that 
illustrates the gain response of the beamforming system at a particular signal 
frequency over different directions of arrival. Figure 2 shows an example of two 
25 different beampatterns in which signals from certain directions of arrival are 

attenuated (or enhanced) relative to signals from other directions. The first is the 
cardioid pattern 200, typical of some end-fire microphone arrays, and the other 205 is 
the beampattern typical of broad-side microphone arrays. Figure 3 illustrates typical 
configurations for end-fire 300, 305, 310 and broadside 320, 325, 330 microphone 
30 arrays. 
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More recent Fast Fourier Transform (FFT>based approaches attempt to 
improve upon the traditional time-domain approaches by implementing directional 
processing in the frequency-domain. However, many of these FFT-based approaches 
suffer from wide sub-bands that are highly overlapped, and therefore provide poor 
5 frequency resolution. They also require longer group delays and more processing 
power in computing the FFT. 

Accordingly, there is a need to solve the problems noted above and also a need 
for an innovative approach to enhance and/or replace the current technologies. 

SUMMARY OF THE INVENTION 

10 The invention described herein is applicable to both the end-fire and broadside 

microphone configurations in solving the problems found in conventional 
beamforming solutions. It is also possible to apply the invention to other geometric 
configurations of the microphone array, as the underlying processing architecture is 
flexible enough to accommodate a wide range of array configurations. For example, 

1 5 more complex directional systems based on two or threodimensional arrays, used to 
produce beampattems having three dimensions, are known and are suitable for used 
with this invention. 

In accordance with an aspect of the present invention, there is provided a 
directional signal processing system for beamforming a plurality of information 

20 signals, which includes: a plurality of microphones; an oversampled filterbank 

comprising at least one analysis filterbank for transforming a plurality of information 
signals in time domain from the microphones into a plurality of channel signals in 
transform domain, and one synthesis filterbank; and a signal processor for processing 
the outputs of said analysis filterbank for beamforming said information signals. The 

25 synthesis filterbank transforming the outputs of said signal processor to a single 
information signal in time domain. 

In accordance with a further aspect of the present invention, there is provided 
a method of processing a plurality of channel signals for achieving approximately 
linear phase response within the channel, which includes a step of performing filtering 
30 by applying more than one filter to at least one channel signal. 
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In accordance with a further aspect of the present invention, there is provided 
a method of processing at least one information signal in time domain for achieving 
approximately linear phase response, which includes a step of performing an 
5 oversampling using at least one oversampled analysis filterbank. The oversampled 
analysis filterbank applies at lease one fractional delay impulse response to at least 
one filterbank prototype window time. 

The directional processing system of the invention takes advantage of 
oversampled analysis/synthesis filterbanks to transform the input audio signals in time 

10 domain to a transform domain. Example of common transformation methods includes 
GDFT (Generalized Discrete Fourier Transform), FFT, DCT (Discrete Cosine 
Transform), Wavelet Transform and other generalized transforms. The emphasis of 
the invention described herein is on a directional processing system employing 
oversampled filterbanks, with the FFT method being one possible embodiment of said 

1 5 filterbanks. An example of the oversampled, FFT-based filterbanks is described in 
United States Patent 6,236,731 "Filterbank Structure and Method for Filtering and 
Separating an Information Signal into Different Bands, Particularly for Audio Signal 
in Hearing Aids" by R. Brennan and T. Schneider, incorporated herein by reference. 
An example of an hearing aid apparatus employing said oversampled filterbanks is 

20 described in United States Patent 6,240, 192 "Apparatus for and Method for Filtering 
in an Digital Hearing Aid, Including an Application Specific Integrated Circuit and a 
Programmable Digital Signal Processor" by R. Brennan and T. Schneider, 
incorporated herein by reference. However, this use of oversampled analysis/synthesis 
filterbanks in the general framework of the directional processing system disclosed 

25 herein has not been reported before. 

The sub-band signal processing approach described henceforth, with its 
corresponding FFT-based method being one possible embodiment of the oversampled 
filterbanks employed in the invention disclosed herein, has the advantage of directly 
addressing the frequency-dependent characteristics in the directional processing of 
30 broadband signals. Compared to traditional time-domain and FFT-based approaches, 
the advantages of using an oversampled filterbank in sub-band signal processing 
according to the present invention are as follows: 
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1 ) Equal or greater signal processing capability at a fraction of the processing 
power, 

2) Orthogonalization effect of the subband signals in the different frequency 
bins due to the FFT of the oversampled filterbank, 

3) Improved high frequency resolution, 

4) Better spatial filtering, 

5) Wide range of gain adjustment at a very low cost of processing power, and 

6) Ease of integration with other algorithms. 

As a result, the sub-band directional processing approach with an oversampled 
filterbank allows powerful directional processing capability to be implemented on 
miniature low-power devices. For applications employing the invention, this means: 

1) Better listening comfort and speech perception (particularly important for 
hearing aids), 

2) More accurate recognition for speech and speaker recognition systems, 

3) Better directionality and higher SNR, 

4) Low group delay, and 

5) Lower power consumption. 

Thus, the present invention is applicable for audio applications that require a 
high fidelity and ultra low-power processing platform. 

A further understanding of the other features, aspects, and advantages of the 
present invention will be realized by reference to the following description, appended 
claims, and accompanying drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Embodiments of the invention will now be described with reference to the 
accompanying drawings, in which: 

Figure 1 shows a block diagram of a general directional processing system; 

5 Figure 2 shows an example of two different beampatterns; 

Figure 3 shows the array configuration of the end-fire and broadside arrays; 

Figure 4 shows a block diagram of the adaptive beamformer system according 
to one embodiment of the invention; 

Figure 5 shows a block diagram of the adaptive beamformer system according 
1 0 to another embodiment of the invention; 

Figure 6 shows a traditional time-domain beamformer structure; 

Figure 7 shows a sub-band beamformer using an oversampled filterbank 
according to another embodiment of the present invention; 

Figure 8 shows another preferred embodiment modified for compensating the 
1 5 bandwidth of the sub-bands; 

Figure 9 shows another preferred embodiment modified for compensating the 
undesirable low-frequency beamformer response; and 

Figure 10 show another preferred embodiment using a neural network as a 
beamformer filter according to the invention. 

20 DETAILED DESCRIPTION OF THE INVENTION 

Turning now to Figure 4 an adaptive beamformer system embodying the 
invention in block diagram form is shown. Note that it is assumed that the outputs of 
the L microphones 400 (I > 2) are already converted to digital form by a set of 
analogue-to-digital converters (ADC) (not shown). Similarly, the output is assumed to 
25 be converted from digital form by an digital-to-analogue converter (DAC) (not 
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shown) to produce an appropriate output signal 490. The digitized outputs of the L 
microphones 400 are first combined in a combination matrix 415. The combination 
matrix 415 can be any Finite Impulse Response (FIR) filter with multiple input and 
outputs (the number of outputs M being less or equal to the number of inputs L (M <> 
5 £)). Suitable matrices include a delay-and-sum network, a sigma-delta network, and a 
one-to-one mapping of the inputs to the outputs (for example some general matrix 
through which L inputs are transformed into L (i.e. M=L) outputs)). TheM outputs of 
the combination matrix 415 are then transformed to the frequency domain by an 
analysis filterbank 420, with N sub-bands per combination matrix output to produce 

10 M x N signals for processing. The (oversampled) analysis filterbank 420 used in this 
embodiment is the weighted-overlap-add (WOLA) filterbank described in United 
States Patent 6,236, 731 "Filterbank Structure and Method for Filtering and Separating 
an Information Signal into Different Bands, Particularly for Audio Signal in Hearing 
Aids" by R. Brennan and T. Schneider. An adaptive system 460 then generates a 

15 weighted sum of the analysis filterbank outputs which are applied to the outputs by 
the multiplier 425. The weights (also known as filter taps) of the adaptive system 460 
are adapted according to well known adaptive strategies including, but not limited to, 
those based on Least Mean Squares (LMS), and Recursive Least Squares (RLS). The 
outputs of the multiplier 425 are then passed to a summer 430 which produces^ 

20 outputs, each a weighted sub-band derived from the original microphone signals. The 
overall adaptation process is further controlled by the outputs of a side process 
comprising an estimations block 450, and a post-filter adapter 455. The estimations 
block of the side process 450 may include one or more of a Voice Activity Detector 
(VAD), a Target-to- Jammer Ratio (TJR) estimator, and a Signako-Noise Ratio 

25 (SNR) estimator. The outputs of the estimations block 450 are then used to slow 

down, speed up, or inhibit the adaptation process by controlling the weight adaptation 
460, and also combined with post-filter adaptation 455 to control the post-filter 435. 
After passing through a summer 430 which combines the processed Mx N inputs 
received from the adaptive processor 460, 425 intoN sub-bands, the post-filter 435 

30 operates in the frequency domain to further process the signal depending on the 

output from the post-filter adapter 455, After post-filtering the N sub-band frequency 
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domain outputs are processed by a synthesis filterbank 440 to generate a time-domain 
output 490. 

Oversampled filterbanks offer the general advantages explained in the 
summary above by virtue of their flexibility and the fabrication technology. Further 
5 advantages of their use for the adaptive beamformer application of the present 
invention are: 

1) Directional processing using prior art techniques requires very long 
adaptive filter lengths particularly in reverberant environments, as reported by other 
researchers (see J. E. Greenberg, "Improved Design of Microphone-Anay Hearing 

10 Aids", PhD Thesis, MIT, Sept 1994). The sub-band adaptation using the 

oversampled filterbank can efficiently implement the equivalent of a long filter 
through parallel sub-band processing. 

2) In frequency domain beamforming (both adaptive and fixed), there is a 
need to weight the Fast Fourier Transform (FFT) coefficients in a highly 

1 5 unconstrained way. A typical adaptive post-filtering operation is the multiple- 
microphone Wiener filtering, in which the frequency response is adapted depending 
on the Signal-to-Noise Ratio (SNR) of the received signal. In this process, there is a 
need for unconstrained gain adjustments across the frequency bands. The 
oversampled filterbank implementation allows a wide range of gain adjustments 

20 without creating the so-called "time-aliasing" problem that happens in the critically 
sampled filterbanks. It has been observed that the operation cost is not much higher 
than the critically sampled filterbanks and much lower than the undecimated 
filterbanks. For more information see United States Patent 6,236, 731 "Filterbank 
Structure and Method for Filtering and Separating an Information Signal into 

25 Different Bands, Particularly for Audio Signal in Hearing Aids". R. Brennan and T. 
Schneider, and "A Flexible Filterbank Structure for Extensive Signal Manipulations 
in Digital Hearing Aids", R. Brennan and T. Schneider, Proc. IEEE Int. Symp. 
Circuits and Systems, pp.569-572, 1998. 

3) The so-called "Misadjustment" error, where there is excessive Mean Square 
30 Error when compared to an optimal Wiener filter, is typically present in adaptive 
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systems. It is well known and understood that sub-band and orthogonal decomposition 
reduces this problem. The oversampled filterbank used in the invention employs such 
decomposition in at least one preferred embodiment. 

4) Estimation of Target-to-Jammer Ratio (TJR) usually requires the cross- 
5 correlation of two or more microphone outputs (as described in "Improved Design of 
Microphone-Array Hearing Aids", J. E. Greenberg, PhD Thesis, MIT, Sept. 1994). 
The frequency domain implementation of the process using the oversampled 
filterbank is much fester and more efficient than the time-domain methods previously 
used. 

10 5) By using the side process outputs of the Voice Activity Detector (VAD), 

the Target-to-Jammer Ratio (TJR) estimator, and the Signal-to-Noise Ratio (SNR) 
estimator, the adaptation process can be slowed down or totally inhibited when there 
is a strong target (like speech) presence. This enables the system to work in 
reverberant environments. There are enough pauses in speech signal to ensure that the 

1 5 inhibition process does not disturb the system performance. A suitable efficient 
frequency domain VAD that uses the oversampled filterbank is described in a co- 
pending patent application "Sub-band Adaptive Signal Processing in an Oversampled 
Filterbank", K. Tarn et al., Canadian Patent Application Serial 2,354,808, August 
2001, US application serial , incorporated herein by reference. 

20 According to a further preferred embodiment of the invention, shown in 

Figure 5, the weight adaptation process is performed on a set of fifixed beams for 
each sub-band constructed or synthesised from the sub-bands derived from each 
microphone output, rather than the microphone outputs themselves or the sub-bands 
of such outputs. Within Figure 5 most of the elements ae the same as Figure 4, and 

25 have been notated with the same reference numbers. Therefore these elements will not 
be described again. The new elements introduced in this embodiment are the Fixed 
Beamformer 510 which produces B main beams from the sub-bands, and a weight 
adaptation block 520 which controls the multiplier 425, based on inputs from the 
VAD, TJR and SNR estimations block 450, and the sub-band signals output by the 

30 Fixed Beamformer 510. Generally this strategy provides a smoother and more robust 
transition when the adaptive filtering weights are changed. The weight adaptation is 
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controlled by some TJR and/or SNR estimations based on, but not limited to, one or 
more of the following signal statistics: auto-correlation, cross-correlation, subband 
magnitude level, subband power level, cross-power spectrum, cross-power phase, 
cross-spectral density, etc. One possible filtering weight adaptation strategy based on 
a simplified SNR estimation is proposed here, and other similar or related methods 
may occur to those skilled in the art, and it is our intention that these be covered. 
When the side process detects the absence (or near absence) of the target, die time- 
averaged energy of the noise in each of the beams (denoted by En(7), I=\ ,2, . . .Ji) is 
measured. When the target reappears, the time-averaged energy of the target (Et(/) ) 
and the SNR in each beam (SNR(J) ) are estimated, given the total averaged energy in 
the beam Etot(/), by: 

Et(7)=Etot(7)-En(7) , /=1,2,...,£ 

SNR(J)=Et(i)/En(/) 

If the noise statistics, and noise and target directions do not change much from 
one target signal pause to the next pause, the SNR(Q for each beam can be used to 
make a weighted sum of the beams. However, if the noise is highly nonstationary, or 
if the noise and/or target sources are moving quickly, an adaptive processor should be 
employed to adjust the weights. For improved performance, the fixed beamformer can 
be designed with a set of narrow beams covering the azimuth and elevation angles of 
interest for a particular application. 

A further embodiment of the invention in a fixed beamforming application 
will now be discussed. The classical method of implementing a fixed beamformer is 
the delay-and-sum method. Because of the physical spacing of the microphones in the 
array, there is an inherent time delay between the signals received at each 
microphone. Hence, die delay-and-sum method utilizes a simple time-delay element 
to properly align the received signals so that the signals arriving from certain 
directions can be maximally in-phase, and contribute coherently to the summed output 
signal. Any signal arriving from other directions then contributes incoherently to the 
output signal, so that its signal power can be reduced at the output. 
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With the FER-filter method, the FIR filters are generally designed so that their 
phase responses take on the role of aligning the received signals to create die desired 
beampattem. These filters can be designed using transformation from analogue filters, 
or direct FIR filter design approaches. When complex broadband signals are involved, 
5 such time-domain filter designs generally require the availability of a significant 
amount of computation power. For comparison, Figure 6 shows a fixed beamformer 
structure using the prior art time-domain approach. In the figure an array of three 
microphones 600, 601, 602 is disposed in a known pattern, although a greater number 
of microphones might also be used. The outputs of each microphone in the array 600, 

10 60 1 , 602 is passed to a separate time-delay element (or FIR Filter) 6 1 0, 6 1 1 ,6 1 2, 
whose outputs are passed in turn to a summer 620. The summer 620, when the time 
delay elements are correctly set as described above, provides an enhanced output 630 
for a particular spatial direction with respect to the microphone array. Usually, this 
setting of the time delay elements 610, 61 1,612, is accomplished dynamically, but is 

1 S often a compromise depending on the factors including the frequency of the signal, 
and the relative spacing of the microphones in the array. If a number of beams were 
required, each would be constructed or synthesised using a similar circuit. For that 
reason these systems are expensive, high in power consumption, complex and hence 
limited in application. 

20 Further preferred embodiments of the invention described herein perform a 

series of narrowband processing steps to solve the more complex broadband problem. 
The use of the oversampled filterbank allows the narrowband processing to be done in 
an efficient and practical manner. Figure 7 shows a sub-band fixed beamformer using 
an oversampled filterbank according to another embodiment of the present invention. 

25 The system is very similar to that described in Figure 4, For convenience and clarity, 
the same components are identified by the same reference numbers in both figures. 
The digital versions of the signals received at theL-microphone array 400 are 
combined through a combination matrix 415 into M signal channels Q4<*L) before 
being sent to the analysis filterbank 420. The analysis filterbank 420generates N 

30 frequency sub-bands for each channel, whereupon the beamforming filter 7 1 0 applies 
complex-valued gain factors for achieving the desired beampattem, based on inputs 
from the VAD, TJR and SNR estimation block 450, and the level of signal in thesub- 
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bands produced by the analysis filterbank 420. The gain factors can be applied either 
independently for each channel and sub-band, or jointly through all channels and/or 
sub-bands by some matrix operation. After the gain factors are applied by the 
multiplier 425, the M channels are combined to form a single channel through a 

5 summation operation 430. A post-filtering process 435 can then be applied to provide 
further enhancement as before (such as improving the SNR) making use of the side 
process 450, 455. Afterwards, the synthesis filterbank 440 transforms the single 
channel composed of N sub-bands back to time-domain. In further embodiments, the 
post-filtering is applied in the time-domain, after the signal channel is converted back 

10 to time-domain by the synthesis filterbank, although, compared to frequencjKlomain 
post-filtering, this typically requires more processing power. 

The complex-valued gain factors of the beamforming filter can be derived in a 
number of ways. For example, if an analogue filter has been designed, then it can be 
implemented directly in sub-bands by simply using the centre frequency of each sub- 

1 5 band to look up the corresponding complex response of the analogue filter (frequency 
sampling). With sufficiently narrow sub-bands, this method can create a close digital 
equivalent of the analogue filter. In a further embodiment of the invention, to closely 
approximate the ideal phase and amplitude responses for wider sub-bands, a 
narrowband filter to each sub-band output is applied as will now be described in 

20 relation to Figure 8 in which again, many of the components are the same as for the 
earlier Figure 7, and for which those same components are for convenience and clarity 
referred to by the same reference numbers. The additional function for this 
embodiment is performed in the Narrowband Prototype Filters 815. To approximate 
an ideal linear phase response of the beamformer, the filters 8 1 5 are designed as all- 

25 pass with a narrowband linear phase response. In a further embodiment, the filters are 
further constrained to being identical, and are moved back before the FFT modulation 
stage by combining its impulse response with the filterbank prototype window. One 
possible combination is a time convolution of the filterbank prototype window with a 
fractional delay impulse response. As a means of eliminating the external noise at the 

30 acoustic output stage, an Active Noise Cancellation (ANC) module is optionally 
added to the system in a manner similar to the system described in a co-pending 
patent application "Sound Intelligibility Enhancement Using a Psychoacousttc Model 
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and an Oversampled Filterbank", T. Schneider et. al., Canadian Patent Application, 

serial 2,354,755, US serial , incorporated herein by reference. Hie ANC, 

as also shown in Figure 8, consists of a microphone 820 positioned at the output 490, 
plus a loop filter 830 to provide feedback to the combination matrix 415. 

5 Almost all implementations of beam formers suffer from a low-frequency roll- 

off effect. To compensate for this effect, most systems, including the proposed 
system, introduce low-frequency amplification. However, because of the unavoidable 
microphone internal noise, this inherently leads to a high level of output noise at very 
low frequencies. As is well known, the result is that the desired beampattem can only 

10 be obtained for the frequencies above some cut-off value (usually around 1 kHz based 
on a particular microphone separation distance). In a further embodiment, shown in 
Figure 9, to avoid a high-level of low-frequency noise, the microphone signals are 
separated into high frequency and low-frequency components by high-pass filter 
(HPF) 920 and low-pass filter (LPF) 910. Again, many of the same components used 

15 in the preferred embodiment described with reference to Figure 7 are used, 

performing the same function, and are given the same reference numbers. The high 
frequency components output by the high pass filter 920 are processed by the 
beamforming filter 710, multiplier 7425, and Narrow band prototype filters 815, as 
before. The low-frequency components by-pass the beamforming filter 710, multiplier 

20 7425, and Narrow band prototype filters 815, relying solely on the post-filter 435 to 
provide low-frequency signal enhancement. 

Besides the conventional digital filter design methods, the beamformer filter 
710 in Figure 7 can also be implemented using an Artificial Neural Network (ANN). 
The ANN can be employed as a type of non-parametric, robust adaptive filter, and has 

25 been increasingly investigated as a viable signal processing approach. One further 
possible embodiment of the present invention is to implement a neural network 1010 
as a complete beamforming filter, as shown in Figure 10. Once again the same 
reference numbers as Figure 4 are used for those components that are unchanged in 
function. The neural network 1010 accepts inputs from the sub-bands output by the 

30 analysis filterbank, and uses these to control the multiplier 425 which affect those 

sub-bands. The post filter adaptor 455 in this case accepts as input the results of each 
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sub-band after the multiplier operation 425, and is again used to adapt the post 
filtering block 435. 

The Cascaded Hybrid Neural Network (CHNN), designed specifically for sub- 
band signal processing, can be used to implement a beamforming filter. The CHNN 

5 consists of two classical neural networks -the Self-Organising Map (SOM) and 
Radial Basis Function Network (RBFN) - connected in a tapped-delay line structure 
(for example, see "Adaptive Noise Reduction Using a Cascaded Hybrid Neural 
Network", E. Chau, MSc. Thesis, School of Engineering, University of Guelph, 
2001. The neural network can also be used to provide integrated functions of the 

10 ANC, the beamforming filter and other signal processing algorithms in the sub-band 
signal processing system. 

While the present invention has been described with reference to specific 
embodiments, the description is illustrative of the invention and is not to be construed 
as limiting the invention. Various modifications may occur to those skilled in the art 
15 without departing from the true spirit and scope of the invention as defined by the 
appended claims. 
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What is claimed is: 



1 . A directional signal processing system for beamforming a plurality of 
information signals, said directional signal processing system comprising: 

5 a plurality of microphones; 

an oversampled filterbank comprising at least one analysis filterbank for 
transforming a plurality of information signals in time domain from the microphones 
into a plurality of channel signals in transform domain, and one synthesis filterbank; 
and 

10 a signal processor for processing the outputs of said analysis filterbank for 

beamforming said information signals, 

the synthesis filterbank transforming the outputs of said signal processorto a 
single information signal in time domain. 



15 2. A directional processing system as claimed in claim 1, wherein said transform 
domain is a frequency domain. 



3. The directional processing system as claimed in claim 1 or2 further comprises 
at least one of any of the following: 

20 a post-filter provided between said signal processor and said synthesis 

filterbank; 

a controller for controlling said post-filter; 

a voice activity detector; 

a target-to-jammer ratio estimator; 
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a signal-to-noise ratio estimator; 

an analog-to-digital convenor for converting said information signals to a 
plurality of digital information signals for supplying said digital information signals to 
said analysis filterbank; 

5 a digital-to-analog converter receiving the outputs of said synthesis filterbank 

for converting a digital information signal to analog information signal; 

a combination matrix provided between said analog-to-digital converter and 
said analysis filterbank for pre-processing of said information signals in time domain; 

an active noise processor comprising a microphone and a loop filter. 

10 

4. The directional processing system as claimed in claim 1, wherein the analysis 
filterbank applies at least one fractional delay impulse response to at least one 
filterbank prototype window. 

15 5. A directional processing system as claimed in claim 3, wherein said controller 
controls said post-filter based on the outputs of at least one of any of the following: 

said voice activity detector; 

said target-to-jammer ratio estimator; 

said signal-to-noise ratio estimator. 

20 

6. A directional processing system as claimed in claim 3> wherein said 
combination matrix is a FIR filter. 
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7. A directional processing system as claimed in claim 3, wherein said 
combination matrix is an IIR filter. 



8. A directional processing system as claimed in claim 1 , 2 or 3, wherein said 
5 signal processor further comprises: 

at least one multiplier for multiplying the outputs of said analysis filterbank 
with at least one weight factor; and 

at least one summation circuit for summing the outputs of said multiplier to 
form the channel signals. 

10 

9. A directional processing system as claimed in claim $ wherein said signal 
processor further comprises an adaptive processor for adjusting said weight factor. 



1 0, A directional processing system as claimed in claim 9, wherein said adaptive 
1 5 processor adjusts said weight factor based on the outputsof at least one of any of the 

following: 

a voice activity detector; 

a target-to-jammer ratio estimator; 

a signal-to-noise ratio estimator; 

20 

11. A directional processing system as claimed in claim 1 , 2 or 3, wherein said 
signal processor further comprises: 

at least one fixed beamformer receiving the outputs of said analysis filterbank 
for beamforming said information signals with a specific beampattem; and 
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at least one multiplier for multiplying the outputs of said fixed beamformer 
with at least one weight factor. 

12. A directional processing system as claimed in claim 1 1, wherein said signal 
S processor further comprises at least one of any of the following: 

a summation circuit for summing the outputs of said multiplier to form the 
channel signals; 

an adaptive processor for adjusting said weight factor. 

10 13. A directional processing system as claimed in claim 1 1, wherein at least one 
fixed beamformer comprises a circuit for processing the channel signals for achieving 
approximately linear phase response within the channel, the circuit apples one or 
more filter to at least one channel signal. 

15 14. A directional processing system as claimed in claim 1 % wherein the filter is an 
IIR filter. 



1 5. A directional processing system as claimed in claim 1,2 or 3, wherein said 
signal processor further comprises: 

20 at least one multiplier for multiplying the outputs of said analysis filterbank 

with at least one beamforming Alter tap; and 

at least one summation circuit for summing the outputs of said multiplier to 
form die channel signals for beaforming said information signals. 
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16. A directional processing system as claimed in claim IS, wherein said signal 
processor further comprises at least one of any of the following: 

an adaptive processor for adjusting said beamforming filter tap; 

a circuit for processing a plurality of channel signals to achieve approximately 
5 linear phase response within the channel, the circuit applying at one or more filter to 
at least one channel signal; 

a processor for dividing the outputs of said analysis filterbank such that at 
(east one channel signal can be processed differently from the other channel signals. 



10 17. A directional processing system as claimed in claim 1 6, wherein said circuit 
comprises an IIR filter. 



1 8. A directional processing system as claimed in claim 1$ wherein said 
processor for dividing the outputs of said analysis filterbank includes at least one 
1 5 high-pass filter and at least one low-pass filter. 



19. A directional processing system as claimed in claim 1$ wherein said 
summation circuit receives the outputs of said multiplier and at least one of any of the 
channel signals that have been processed differently. 

20 

20. A directional processing system as claimed in claim 1 or 2, wherein said signal 
processor comprises at least one of any of the following: 

a neural network receiving the outputs of said analysis filterbank; 

a multiplier for multiplying the outputs of said neural network with the outputs 
25 of said analysis filterbank; 
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a summation circuit for summing the outputs of said multiplier to form a 
plurality of channel signals; 

a post-filter provided between said summation circuit and said synthesis 
filterbank; 

5 a controller for controlling said post-filter. 



21 . A directional processing system as claimed in claim 20t wherein said neural 
network is a Cascaded Hybrid Neural Network. 



10 22. A method of processing a plurality of channel signals for achieving 

approximately linear phase response within the channel, said method comprising the 
step of performing filtering by applying one or more filter to at least one channel 
signal. 



15 23. A method of processing a plurality of channel signals as claimed in claim 22, 
wherein said filter is an IIR filter. 



24. A method of processing at least one information signal in time domain for 
achieving approximately linear phase response, said method comprising the step of 
20 performing an oversampled transformation using at least one oversampled analysis 
filterbank, said oversampled analysis filterbank applying at lease one fractional delay 
impulse response to at least one filterbank prototype window. 



25 
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